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Overview of Web Archive Access 


[<3] Web Archive Access - Wayback Machine (archive.org) 


e View historic captures of websites (625 billion pages) 
e Search by URL or keyword (not full text search) 
Includes Internet Archive-run crawling as well as Archive-It and other 
partner-directed crawls 
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Access to all public 
collections in Archive-lt 
Browse by URL and search 
metadata and full text 
Browse by organization, 
collection or sites 


Web Archive Access - 


archive-it.org 
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for collecting and accessing | | | | 
ARCHIVE-T cultural heritage on the web pam 


Built at the Internet Archive 


to learn more about our products and services. 


Search | show All Collections 


Mormon Blogs Collection Ukraine Conflict Everglades Explorer — EAPRA 

By Brigham Young University By Internet Archive Global Events (Assorted PDF & Report Archive) 
By Florida International University Libraries 

Features the lifestyle and culture of Mormons This collection seeks to document conflict in : 

through self published blogs Ukraine as it progresses. Content includes news An archive of digital government and non- 


iets saclas wuedia' loge! arid yevernmaent government organization (NGO) documents 


websites. Sites are written in English, nd reports; Zepresanting the Greater 


Everglades watershed and adjacent ecosystems, 
including... 


Russian,... 


Explore Collecting Organizations |Find an Organization by Name | | Search | show Ail Organizations 


Web Archive Access - communitywebs.archive-it.org 


e Access to all public Community Webs member collections 

e Browse by URL and search metadata and full text 

e Featured Collection topics allow access to similar collections across 
members. 
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Home About Collections Participants Curriculum News Apply 
Q Search Clear 


Ps Organizations Collections Sites Page Text 
Community Webs 


Empowering Cultural Heritage Organizations to Create Community History Web Archives Narrow Your Results 


Q Search across 100s of Web Collections Search 


evious Page Page 1 of 33 (653 Total Results) Next Page 


Organization 
#DaytonStrong 


Organization: Dayton Metro Library 
Cleveland Public Library (39) Pn ae He : ; aed 
iscii ‘i . Grand Rapids Public Library (36) Description: This collection includes web documents that record the tragic events that too! 
Explore Organizations by Location About this Program East Baton Rouge Parish Library (32) place in Dayton in 2019. Events in this collection include the Memorial Day tornadoes, the 
New Brunswick Free Public Library August 4 Oregon District Shooting and Gem City Shine. 


Community Webs, a program of Archive-It and the Internet Archive, was Canin Public Library (21) Subject: Natural Disasters, Spontaneous Events 


launched in 2017. Its mission is to advance the capacity for public libraries and DC Public Library (19) Creator: Dayton Metro Library 
other cultural heritage organizations to build archives of web-published primary Henderson County Libraries (14) 
sources documenting local history and underrepresented voices. The program Pueblo City-County Library District 
achieves this mission by providing resources for professional training, (14) Rights: http;//rightsstatements.org/vocab/InC-EDU/1.0/ 
technology services, networking, and in support of scholarly research use. State Library of Pepnsyivania((12) Type: Archived Website 


Choctaw Cultural Center (11) 
More about this program 


Publisher: Dayton Metro Library 


Featured Collections Themes #Syllabus 


Organization: Schomburg Center for Research in Black Culture, New York Public Library 


Description: Inspired by the #Syllabi have been developed by educators, activists, 
organizations, and community members since 2014, the #Syllabus web archive collection 
aims to web archive Black-authored and Black-related educational resources to document 
Black studies, movements, and experiences in the twenty-first century. 


COVID-19 (1) 
Crises & Disasters (1) 
LGBTQ+ Resources (1) 


Community Webs Access Site Demo 


Metadata Integration Updates 


[<3] What is the Digital Public Library of America (DPLA)? 


Digital Public Library of America: htips://dp.la/ 
e Launched in 2013 

e Metadata aggregation 

e Content Hubs (including Community Webs) 


DPLA 


DIGITAL PUBLIC LIBRARY OF AMERICA 


«<3} What Community Webs content will be included in DPLA? 


e US-based Community Webs members 

e Public Collections and their seeds 
o Must have at least Collection level rights statement 

e Member-created metadata, plus automated metadata as needed 

e Pointers back to the archived content 

e Updates on a regular basis (approximately quarterly): Full 
replacement each time 

e Launched late August 2022 with >4,800 items 


«3) Collection Level Metadata in DPLA 


Title (Collection Name) 
Rights - rightsstatements.org or creativecommons.org URL 
Link to resource: Archive-It collection page, eg: 
https://archive-it.org/collections/1506 
o Image: 
m Collection image or account image 
Description 
Subject(s) 
Coverage, Creator, Language, Relation, Date 


«3) Seed Level Metadata in DPLA 


o Title 
m If no title exists, we grab title from live web html, otherwise 
use URL 
o Link to resource: Seed calendar page, eg: 
https://wayback.archive-it.org/8092/*/https://www.boisestate.edu/ 
o Rights 
m Inherits from collection level if not set for individual seeds 


«3} Seed Level Metadata in DPLA - continued 


o Description 
m If description exists use that, if not, we try to scrape from live 
site 
o Type 
m Interactive Resource (this is set by DPLA) 
o Format 
m If no metadata is provided will use “Archived Website” 
o Collection (links to collection level metadata) 
o Coverage, Creator, Language, Relation, Date if provided by you! 


«3) Seed Level Metadata in DPLA - Image 


Image: We try to get a representative image 
of the website by checking: 


1. 


Does the site have a preferred image 
for Twitter? 

Was the site crawled using Brozzler 
(which takes screenshots)? 

Does the site have an easily 
identifiable logo? 

Does the site have a favicon? 


EDUCATION 
| Waegyiy rca 


CPS to resume food distribution program after safety 
concerns led to day-long suspension - Chicago Sun- 
Times CPS to resume food distribution program a... 


Nader Issa 

The district, the nation’s third largest, has given out more than 12.5 
million meals since the start of the coronavirus pandemic. ;This 
collection centers and documents the African diasporan 
experience... 

View Full Item © in Schomburg Center for Research in Black Culture 


Home | Schomburg Education Education at the 
Schomburg Center for Research in Black 


Schomburg Education Department, Schomburg Center for Research 
in Black Culture, The New York Public Library 

Home | Schomburg Education;Collections of websites, resources, 
and recordings created by the Schomburg Center Education 
department. 

View Full ltem @ in Schomburg Center for Research in Black Culture 


Using NYPL resources to enrich your African American 
History studies | The New York Public Library Using 
NYPL resources to enrich your African America... 


Lynda Kennedy 

Toussaint Louverture, about 1795 (NYPL Digital Gallery)Use the 
resources of NYPL to engage your students with the rich and 
complex history of African Americans.;Collection of blog posts 
written by Sch... 

View Full tem @ in Schomburg Center for Research in Black Culture 


<3) Rights Statements in DPLA 


Must be expressed as a URL, rather than the name of the rights 

designation 

o For example: http://rightsstatements.org/vocab/InC-EDU/1.0/ 
rather than INCOPYRIGHT - EDUCATIONAL USE PERMITTED 

Must come from rightsstatements.org or creativecommons.org 

o https://rightsstatements.org/page/1.0/?language=en 

o https://creativecommons.org/about/cclicenses/ 

Can include local rights statement as well but it cannot contradict the 

standard statement 


Subject 


Beyoncé (2) 
Black Arts Movement (1) 


Publisher 


AAIHS (1) 

African American Policy Forum, Center 
for Intersectionality and Social Policy 
Studies (1) 

Center for Information, Technology, & 
Public Life UNC-Chapel Hilll (1) 

Issuu (1) 

NewBlackMan (in Exile) (1) 

The African American Policy Forum (1) 
Washington Peace Center (1) 


Creator 


Frank Leon Roberts (6) 
Keisha N. Blain (5) 

Dr. Kaye Wise Whitehead (4) 
John Hope Franklin Center (4) 
M. Shadee Malaklou (4) 
Matthew Teutsch (4) 

Zora Editors (3) 

Alicia Moore (2) 

Anthony Boynton (2) 


Previous Page Page 1 of 17 (322 Total Results) Next Page 


Title: Reading List - BGP: The Black Girlhood Project 
URL: http://blackgirlhood.info/bghplist/ 
: #Syllabus 


ion: Schomburg Center for Research in Black Culture 
Captured once on September 29, 2020 
Creator: The Black Girl Project 


Title: The story of how black people confront systems of racial capitalism 
and plot world liberation. A reading list from Robin D. G. Kelley.~ Black 
History in Three Acts 


URL: http://bostonreview.net/race/robin-d-g-kelley-black-history-three-acts/ 


Collection: #¢vllahic 


Organiza Sites Search Page Text 


Captured Subject Sort By: Count | (A-Z) 


Creator: Page 1 of 4 (322 Total Results) 
Date: Jur Beyoncé (2) 


Black arte Mnventent( Sort By: Title (A-Z) | Title(Z-A) | URL(A-Z) | URL(Z-A) 


Title: Cc Creator Sort By: Count | (4-2) Title: Reading List 
URL: http URL: http://blackgirlhood.info/bghplist/ 
Beatty Leon Roberts (6) Captured once on September 29, 2020 


Keisha N. Blain (5) Creator: The Black Girl Project 


Dr. Kaye Wise Whitehead (4) title: Reading List — BGP: The Black Girlhood Project 
Captured John Hope Franklin Center (4) 


M. Shadee Malaklou (4) 


Collectio 
Organiza 


Title: Black History in Three Acts, The story of how blac 
confront systems of racial capitalism and plot world lik 
reading list from Robin D.G. Kelley. 


URL: http://bostonreview.net/race/robin-d-g-kelley-bla 
acts/ 
Captured 2 times between June 26, 2020 and June 26, 2020 


Creator: Robin D. G. Kelley 
Date: June 05, 2020 


Publisher Sort By: Count | (A-Z) 


Aaihs (1) 

African American Policy Forum, Center for 
Intersectionality and Social Policy Studies (1) 
Center for Information, Technology, & Public 
Life UNC-Chapel Hill (1) 


Issuu (1) Title: Confederate Monuments Syllabus, A Crowdsourc 
NewBlackMan (in Exile) (1) Confederate Monuments and Civil War Memory 
URL: http://cwmemory.com/civilwarmemorysyllabus/ 


Captured once on June 02, 2020 
Creator: Kevin M. Levin 


Date Sorty: ‘Count. | a) description: A Crowdsourcing Project About Confederate Monuments and C 
horrific murders committed by Dylann Roof of nine churchgoers in Charlest 

2020/8) 2015 catapulted the debate about Confederate iconography, including flags a 

June 4, 2020 (7) national stage. That debate has expanded in the wa 

July 2017 (5) title: Confederate Monuments Syllabus 


Tune 8. 2020 (4) 


IDI P| LIA 


#Syllabus Collection 


322 results 


DIGITAL PUBLIC LIBRARY 
OF AMERICA 


Fitered by ESET 


Refine your search 


How Can | Use It? @ 


Unspecified Rights Status @ 


Type 


interactive resource 


Subject 
Date 


Between Year 


and Year 


Sort by Items per page Layout 


Relevance vv 20 v 


Please read DPLA’s Statement on Potentially Harmful Content. 


X Clear all filters 


Create a list from these items 


Central Park Five Syllabus: A Supplementary Reading 
List Central Park Five Syllabus: A Supplementary 
Reading List 


Odilka Santiago;Zhandarka Kurti 


Supplementing the documentary, When They See Us, this syllabus 
can help students engage critically with the criminalization of 
working-class youth of color.;Inspired by the #Syllabi have been 
develope... 


View Full Item @ in Schomburg Center for Research in Black Culture 


CSAAD Africa, the African Diaspora, and COVID-19: 
Some Resources 


NYU Center for the Study of Africa and the Africa Diaspora 
CSAAD; Inspired by the #Syllabi have been developed by educators, 


activiete armanizatinne and eammunity mamhare cineca 2014 tha 


DPLA Demo 


DPLA - Remaining questions and points for consideration 


e Default rights statement? 
o Copyright not evaluated: 
https://rightsstatements.org/page/CNE/1.0/?lanqguage=en 
e Organization name in Archive-It vs Organization name in other DPLA 
hub(s) 


Contributing Institution = Contributing Institution — Contributing Institution — 
San Francisco Public Library 17,402 Athens Regional Library System 134 Athens-Clarke County Library 1,640 
Partner _ 

Partner = Partner — 
California Digital Library 16,482 


Community Webs 920 Community Webs 134 Digital Library of Georgia 1,640 


[<3] MetaData Integration: OurDigitalWorld 


Hr on @: 
» Reading List 


() 


> C  @ search.ourontario.ca 


HE Apps [EJ Internet Archive -... My Drive-Google.. Maps {%@ Mailchimp Dashbo... G3) Community Webs 


| ABOUT FAQ HELP | ENFRANCAIS | SHARE Hiv®. 


d over 2.3 million items and collections 
m libraries, archives, museums, historical 
dcieties, community groups, and 
government ministries. 


O U rD Ig ita Wo rl d Discover - people, places, events and 


objects about Ontario. 


View and access - photographs, maps, 
videos, oral histories, government 
documents, newspapers and more... 


[ea] OurOntario.ca 


e Search portal for multiple heritage 
organizations’ collections 

e ~300 heritage organizations are 
indexed 

e Currently searches nearly 3 
million items 

e Ontario's largest online collection 
of newspapers 

e Award-winning platform 


“COLLECTION 
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OB 
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risen peo, _ 
objects about Ont 


bel sealed Bike otographs, maps, 
vero gow 


and more.. 


[23] OurDigitalWorld x Community Webs 


e Ontario-based Community Webs 
content 

e Opt-in participation 

e Metadata to align with ODW 
standards 

e Reducing web archive silos 

e Determining representative image 


COMMUNITY 


OurDigitalWorld 


Use of Web Archive Data 


Archive-It Research Services (2014) 


Web Archive Datasets 


WAT Datasets LGA Datasets WANE Datasets 
(Web Archive (Longitudinal (Web Archive 
Transformation) Graph Analysis) Named Entities) 

Key Metadata from Every What Links to What Names of People, Places, 


Resource over Time Organizations 


Archives Unleashed Project (2017-2020) 


e Est. 2017 with funding from the 
Andrew W. Mellon Foundation Tools 


e Recognizes the critical role of web Toolkit Cloud Notebooks 
archives for scholars studying the 
1990s onward 
co Lower barriers to access & [ > | 
z Resources 
use of web archives ~- 
© Project Priorities: Learning Guides Video tutorials 


o Tool Development 


o Resource Support EI 
o Community Engagement 


Community CC7)a7A 


Datathon Series Collaborations 


Archives[=\\ 


Unleashed 


V 


ARCH Features 


Interactive, familiar environment for 
current Archive-It subscribers 


Transforms Archive-lt collections into 
research datasets for analysis 


Generate and download over a dozen 
datasets 


Standardized dataset format as CSVs 


In-browser visualizations and data 
previews that presents a glimpse into 
collection content 


The Life of Aaron Swartz Analysis 


Completed Jobs 


What can you do with CSV files? 


—> What is your research question? 
COcD @ Text/language based? 
@ Network? 
@ Other 
@} —> What tools/methods are you familiar with? 


Are there any you're interested in further 
exploring? 


+> ARCH User Documentation provides 
some examples of using tools like Gephi, 
Voyant or methods like topic modelling or 
geoparsing with ARCH datasets 


ARCH Demo 


arch ARCH Pilot 


renter Welcome, Sam Fritz 
Home Collections Crawls Archives ARCH 


Life of Aaron Swarte 


The Life of Aaron Swartz Analysis 


Learn More: ARCH Documentation 


Interested in working with ARCH in 


Job Summary 


its pilot phase? aaa 


1seeds Crawled Feb 26, 2013 y Public Collection 


ral A 


Public Collection Link: https://archive-it.org/collections/03492 


There are currently no active jobs, generate a new dataset. 


Fill out this form and the team will be en 
In touch. 


[<3] Thank you! And Resources 


Community Webs Access Portal 


Archive-It Resources: 

o Metadata in Archive-lt 

o Archive-lIt Integrations 

DPLA Resources: 

o Standardized Rights Statements 


OurDigitalWorld 
e ARCH Documentation 


